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THE INVENTION CLAIMED IS: 

N 

1 . A method for catego^kation of an item comprising: 
providing a plurality of categories organized in a hierarchy of categories; 
providing a plurality of categcrizers corresponding to the plurality of categories; 
featurizing the item to create £ list of item features; 

using the list of item features in a categorizer system 1 including the plurality of 

categorizers for determining a plurality of levels of goodness; 
using one of the plurality of levels of goodness for invoking an additional categorizer 

of the plurality of categorizers as required; 
categorizing the item in the categorizer system in the plurality of categories based on 

the respective plurality of levels of goodness; and 
returning the item categorize 

2. The method as claimed in claim 1 wherein: 

using the list of item features determines the plurality of levels of goodness using a 
process to quantify! the plurality of levels of goodness, to prioritize the 
plurality of levels of goodness, and to resolve two levels of goodness into a 
third level of goodness. 

3. The method as claimed in claim 1 including: 

using a categorizer system ] aiowledge base for determining the level of goodness for a 
category with the lis t of item features. 

4. The method as clairied in claim 1 including: 

listing the plurality of categories and the respective levels of goodness on a list; and 
categorizing from the list. 

5. The method as claimed in claim 1 wherein: 

returning one category for he item among the plurality of categories selected from a 
group consisting of he one category with the best level of goodness for all the 
plurality of catego ies and with the best level of goodness for which 
determining is completed where all of the plurality of categories are not 
compared. 

6. The method as claimed in claim 1 wherein: 

returning a plurality of categories for the item among the plurality of categories 
returns a plurality of categories selected from a group consisting of categories 
up to a fixed numbel^of the plurality of categories, categories having more 
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than a fixed level of goodness, categories fulfilling a user specified preference, 
categories not from a categorizer, and categories which are a combination 
thereof. / 

7. The method as claimed in claim 1 wherein: / 
returning the category for a plurality of items establishes a categorizer system 

knowledge base for a topic hierarchy. / 

8. The method as claimed in claim 1 including: / 
listing a plurality of labels for each of the plurality of categories; and 

training a categorizer system trainer using a plurality of items having known 
categories and the plurality of labels to provide/ a categorizer system 
knowledge base. / 

9. The method as claimed in claim 1 including: / 
providing a categorizer system knowledge base; / 

using a plurality of items with known categories to le&rn knowledge in the categorizer 
system knowledge base. / 

10. The method as claimed in claim 1 including: 
providing a categorizer system knowledge bas^; 

providing a plurality of categorizers, each Rising knowledge in a categorizer system 
knowledge base and the list of item features to compute a degree of goodness 
for a plurality of categories, independent of other categorizers, each using a 
subset of item features to compute a degree of goodness for a plurality of 
categories, independent of other categorizers, and each subset independent of 
subsets used by other categorizers; and 

providing a mechanism to resolve the levels of goodness for a plurality of categories 
resulting from multiple categorizers into a combined level of goodness for a 
plurality of categories. 

11. A method for categorization of an item comprising: 

providing a plurality of categories organized in a hierarchy of categories and having 
respective lists of category features using a categorizer system knowledge base 
for determining the lists of category features; 

providing a plurality of categorizers corresponding to one of the plurality of 
categories; 

featurizing me item to create a list of item features; 
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using the list of item features in a categorizer system including the plurality of 
categorizers with the lists of category features to respectively determine a 
plurality of levels of goodness, the plurality of levels of goodness determined 
using a process to quantify the plurality of levels of goodness, to/prioritize the 
plurality of levels of goodness, and to resolve two levels of/goodness into a 
third level of goodness; / 

using one of the plurality of levels of goodness for invoking an/additional categorizer 
of the plurality of categorizers as required; / 

categorizing the item in the categorizer system in the plurality of categories based on 
the respective plurality of levels of goodness; / 

listing the plurality of categories and the respective levels of goodness on a list; and 

returning a category for the item from the list. / 

12. A method for categorization of a document comprising: 
providing a plurality of categories organized in a hierarchy of categories; 
providing a plurality of categorizers corresponding to the plurality of categories; 
featurizing the document to create a list pf document features; 

using the list of document features in/a categorizer system including the plurality of 
categorizers for determining^ plurality of levels of goodness; 

using one of the plurality of levels of goodness for invoking an additional categorizer 
of the plurality of categorizers as required; * 

categorizing the document in categorizer system in the plurality of categories based 
on the respective plurality of levels of goodness; and 

returning a category forAe document. 

13. The method acclaimed in claim 12 wherein: 

determining the plurality of levels of goodness includes using a process selected from 
a group consisting of Naive Bayes, quantitative decision-tree classifiers such 
as C4.5/Bayesian networks, rule-based multi-class classifiers that output a 
degreer of goodness, conditional probability statements, simple heuristics, and 
a combination thereof. 

14. TWe method as claimed in claim 12 including: 

using a yCategorizer system knowledge base for determining the level of goodness for a 
/ category with the list of document features. 



16 



Docket No. 10010076-1 



1 5. The method as claimed in claim 12 including: 

listing the plurality of categories as the document is compared and the respective 

levels of goodness on a list; and I 
categorizing from the list. / 

16. The method as claimed in claim 12 wherein: 

returning one category for the document among the plurality of categories selected 
from a group consisting of the one category with the best level of goodness for 
all the plurality of categories And with the best level of goodness for which 
determining is completed wnere all of the plurality of categories are not 
compared. / 

17. The method as claimed in claim 12 wherein: 

returning a plurality of categories for the document among the plurality of categories 
returns a plurality of categoiies selected from a group consisting of categories 
up to a fixed number of thi plurality of categories, categories having more 
than a fixed level of goodness, categories fulfilling a user specified preference, 
categories not from a categorizer, and categories which are a combination 
thereof. I 

18. The method as claimed in claim 12 wherein: 

returning the category for a plurality of documents establishes a categorizer system 
knowledge base for a topic hierarchy. 

19. The method as claimed injclaim 12 including: 

listing a plurality of labels for eaah of the plurality of categories; and 

training a categorizer system trainer using a plurality of documents having known 

categories and the plurality of labels to provide a categorizer system 

knowledge base. / 

20. The method as claimed In claim 12 including: 
providing a categorizer system /knowledge base; 

using a plurality of documents with known categories to learn knowledge in the 
categorizer system knowledge base. 

21. The method as claimed in claim 12 including: 
providing a categorizer system knowledge base; 

providing a plurality of categorizers, each using knowledge in a categorizer system 
knowledge base and! the list of document features to compute a degree of 
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goodness for a plurality of categories, independent of other categorizers, each 
using a subset of document features to compute a degree of goodness Aor a 
plurality of categories, independent of other categorizers, and each^ subset 
independent of subsets used by other categorizers; and / 
providing a mechanism to resolve the levels of goodness for a plurality or categories 
resulting from multiple categorizers into a combined level of goodness for a 
plurality of categories. / 

22. A method for categorization of a document comprising: / 

providing a plurality of categories organized in a hierarchy of/Categories and having 
respective lists of category features using a categorize^ system knowledge base 
resulting from determining a plurality of documents for determining the lists 
of category features; / 

providing a plurality of categorizers corresponding to me plurality of categories; 

featurizing the document to create a list of document/features; 

using the list of document features in a categorizer system including the plurality of 

categorizers with the lists of category /features to respectively determine a 

plurality of levels of goodness; / 
using one of the plurality of levels of goodness for invoking an additional categorizer 

of the plurality of categorizers as required; 
categorizing the document in categorizer system including the plurality of 

categorizers in the plurality of categories based on the respective plurality of 

levels of goodness from the/ist; 
listing the plurality of categories as the document is compared and the respective 

levels of goodness on a/ist; and 
returning a category for the document from the list. 

23. A system for categorization of an item comprising: 

a plurality of categories (Organized in a hierarchy of categories; 

a plurality of categorizers corresponding to the plurality of categories; 

a featurizer for featurizing the item to create a list of item features; 

a categorizer system including the plurality of categorizers using the list of item 
features /n for determining a plurality of levels of goodness, the categorizer 
system/for categorizing the item in the plurality of categories based on the 
respective plurality of levels of goodness, the categorizer system for using one 
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of the plurality of levels of goodness for invoking an additional categorizer as 
required; and / 
a return for returning the item categorized. / 

24. The system as claimed in claim 23 wherein: / 
determining the plurality of levels of goodness includes using a process to quantify 

the plurality of levels of goodness, to prioritize the plurality of levels of 
goodness, and to resolve two levels of goodness into a thirdr level of goodness. 

25. The system as claimed in claim 23 including: / 

a categorizer system knowledge base for determining the level of goodness for a 
category with the list of item features. / 

26. The method as claimed in claim 23 including: / 

a categorizer system trainer trained using a plurality of items having known categories 
and the plurality of labels to provide a categorizer system knowledge base. 

27. A system for categorization of an item comprising: 

a categorizer system knowledge base having a plurality of categories organized in a 

hierarchy of categories and having respective lists of category features; 
a featurizer for featurizing the item to create ar list of item features; and 
a categorizer system connected to the/ categorization system knowledge base 
including: / 

a plurality of categorizers ha/ing one of the plurality of categories, the 
plurality of categorizers for using the list of item features with the lists 
of category features ko respectively determine a plurality of levels of 
goodness, the plurality of categorizers categorizing the item in the 
categorizer system in the plurality of categories based on the respective 
plurality of levels of goodness, 
a mechanism for using one of the plurality of levels of goodness for invoking 
an additional categorizer of the plurality of categorizers as required; 
and / 

a return for returning the item categorized. 

28. The system as claimed in claim 27 wherein: 

the plurality of oategorizers determine the plurality of levels of goodness using a 
process /to quantify the plurality of levels of goodness, to prioritize the 
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plurality of levels of goodness, and to resolve two levels of goodness into a 
third level of goodness. / 

29. The system as claimed in claim 27 wherein: / 
the categorizer system knowledge base determines the lists of category features. 

30. The system as claimed in claim 27 wherein: / 

the plurality of categorizers include a list mechanism for listing/the plurality of 

categories and the respective levels of goodness; and / 
the plurality of categorizers categorizes from the list mechanism/ 

3 1 . The system as claimed in claim 27 wherein: / 

the return returns one category for the item among the plurality of categories selected 
from a group consisting of the one category with Ine best level of goodness for 
all the plurality of categories and with the bast level of goodness for which 
determining is completed where all of ther plurality of categories are not 
compared. / 

32. The system as claimed in claim 27 wherpn: 

the return returns a plurality of categories/ for the item among the plurality of 
categories returns a plurality of categories selected from a group consisting of 
categories up to a fixed numbe/ of the plurality of categories, categories 
having more than a fixed leyel of goodness, categories fulfilling a user 
specified preference, categories not from a categorizer, and categories which 
are a combination thereof. / 

33. The system as claimed in claim 27 wherein: 

the return returns the category for a plurality of items to the categorizer system 
knowledge base for building a topic hierarchy. 

34. The system as claimed in claim 27 including: 

a further listing mechanism for listing a plurality of labels for each of the plurality of 
categories; and / 

a categorizer system tpiner trained using a plurality of items having known categories 
and the plurality of labels to provide the categorizer system knowledge base. 

35. A system far categorization of an item comprising: 

a categorizer system knowledge base having a plurality of categories having 

respective lists of category features; 
a featurizer for featurizing the item to create a list of item features; and 
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a categorizer system connected to the categorizer system knowledge base including: 



a plurality of categorizers having the plurality of categories, tne plurality of 
categorizers for determining the list of item features with the lists of 
category features to respectively determine a jarurality of levels of 
goodness, the plurality of categorizers categorizing the item in the 
categorizer system in the plurality of categcnies based on the respective 
plurality of levels of goodness, / 

a mechanism for using one of the pluralitVof levels of goodness for invoking 
an additional categorizer of the plurality of categorizers as required 

a listing mechanism for listing the/plurality of categories and the respective 
levels of goodness on a U^t, and 

a return for returning a category for the item from the list. 
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